CDS

Accession Number TCMCG058C32481
gbkey CDS
Protein Id KAF7154318.1
Location complement(join(47040705..47040743,47040885..47041023,47041126..47041370,47041829..47042150,47042414..47042638,47043548..47043714,47043828..47043944,47044368..47044541,47044667..47044762,47045011..47045227,47045322..47045440,47045657..47045849,47045981..47046132,47046788..47046923,47047048..47047145))
Organism Rhododendron simsii
locus_tag RHSIM_Rhsim01G0287000

Protein

Length 812aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA588298, BioSample:SAMN13241185
db_source WJXA01000001.1
Definition hypothetical protein RHSIM_Rhsim01G0287000 [Rhododendron simsii]
Locus_tag RHSIM_Rhsim01G0287000

EGGNOG-MAPPER Annotation

COG_category M
Description Sucrose-cleaving enzyme that provides UDP-glucose and fructose for various metabolic pathways
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00806        [VIEW IN KEGG]
KEGG_rclass RC00005        [VIEW IN KEGG]
RC00028        [VIEW IN KEGG]
RC02748        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01003        [VIEW IN KEGG]
KEGG_ko ko:K00695        [VIEW IN KEGG]
EC 2.4.1.13        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00500        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00500        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0008194        [VIEW IN EMBL-EBI]
GO:0016157        [VIEW IN EMBL-EBI]
GO:0016740        [VIEW IN EMBL-EBI]
GO:0016757        [VIEW IN EMBL-EBI]
GO:0016758        [VIEW IN EMBL-EBI]
GO:0035251        [VIEW IN EMBL-EBI]
GO:0046527        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCTACACCTAAATTGTCGCGAATACCCAGTATGAGAGAGAGAGTCGAAGACACTCTCTCCGCTCACCGCAACGAACTCGTCTCTCTCCTCTCCAGGTATGTGGCTCAGGGGAAGGGGATGCTGCAGCCCCATCACCTGATCGACGAGCTTGATAAGGTAATCGGCGTCGATGACAGAGCCAATCTAACCCTCAGGGATGGTCCCTTCGGCGAAGTCCTCAAGTCCGCTCAGGAGGCCATAGTTCTGCCTCCATTTGTGGCTATAGCTGTTCGTCCAAGGCCTGGTGTTTGGGAATATGTGCGTGTCAATGTGCATGAACTAAGTGTGGAGCAGTTGAGTGTGTCCGAATATCTTCAGTTCAAAGAACAACTTGTGGATGGACACGTCAATGACAATTATGTGCTTGAGCTTGATTTGGAGCCTTTCAATGCATCATTTCCTCGCCCCACTCGATCTTCGTCGATTGGCAATGGTGTTCAATTCCTGAACCGACATCTCTCTTCAGTCATGTTCCGCAGCAAAGATTCTTTGGAGCCCTTACTTGATTTCCTCCGAGCACACAAGCATAAAGGGCAAGCAATGATGTTGAATGATCGCATGTATAGCATACCCAGACTTCAATCTGCATTGACAAAGGCAGAGGGTTATCTTTCTAAGCTACCAGCAGATACACCTTATTCTTTGTTGGAACATGACTTGCAAGGTATGGGTTTTGAAAGAGGTTGGGGTGACACTACAGAGCGGGTTTTGAAGATGATGCATCTTCTCATGGACATCCTCCAAGCTCCTGATCCCTCTGCCTTAGAGACCTTTCTTGGTAGAATACCAATGGTGTTTAACGTCGTCATTTTGTCAATTCATGGGTACTTTGGCCAAGCAAATGTCCTAGGCTTGCCTGACACTGGTGGTCAGGTCGTCTACATACTAGATCAAGTGCGTGCCCTGGAGAATGAAATGCTTATGAAAACAAAGCAGCAAGGACTGAATGTCACTCCTAGGATTCTTATAGTGACACGGTTGATACCTGATGCAAAAGGAACTTCATGCAACCAGCGGCTTGAAAGAATTAGTGGAACTAAACATTCACATATTCTGCGAGTTCCTTTTAGAACAGAAAAAGGAGTTCTTCGTAAATGGATCTCAAGGTTTGATGTATGGCCTTATCTGGAGAAATTTGCAGAGGATGCGGCTAGTGAAATTGCTTCTGAACTACAAGGTGTTCCAGATCTAATTATTGGCAACTATAGTGATGGAAATCTTGTTGCGTCTTTGTTATCTCATAGTATGGGAGTGACACAGTGCACCATTGCTCATGCATTGGAGAAAACGAAATATCCTGACTCAGACATATATTGGAAGAATTTTGAGGATAAATACCACTTCTCAGCCCAATTTACTGCTGATCTAATTGCCATGAATAATTCCGATTTTATCATCACCAGTACATACCAAGAGATCGCCGGAACGAAGAATACCGTTGGTCAGTATGAGAGCCATACAGCTTTCACACTTCCAGACCTGTACCGAGTTGTTCATGGCATTGATGTTTTTGATCCAAAGTTCAATATCGTGTCACCAGGGGCAGATATGTCCATTTATTTTCCGTACTCTGACAAGGAAAATAGGCTTACAGCCCTACATGGTTCAATTGAGAAGTTGTTATTTGGTCCTGAGCAAAATGAAGAGCATATTGGAACATTGAGTGATCCGTCAAAGCCCATAATCTTTTCCATGGCAAGGCTTGACCGTGTGAAAAACATCACAGGGCTGGTAGAGTGCTATGCTAAGAATACCAAGCTAAGGGAACTGGTAAACCTTGTTTTGGTAGCAGGTTACAATGATGTGAAGAAATCAAATGACAGAGAAGAAATTGCTGAAATTGAAAAGATGCACGGGCTTATGAAGGAATACAACTTGGATGGGCAGTTCCGATGGTTGTCATCTCAAACAAATCGAGCACGTAATGGCGAGCTATATCGCTACGTTGCTGACAAGAGAGGTGTTTTCGTGCAGCCTGCATTTTATGAAGCCTTTGGGCTTACAGTTGTGGAGGCCATGGTCAGTGGTCTTCCAACATTTGCCACTTGCCATGGTGGTCCTGCGGAGATTATTGAAGATGGAGTATCAGGGTTCCATATTGACCCCTACCACCCTGATGACGTTGCATCACGTTTAGCAGATTTTTTCCAACGGTGCAAGGAAGATCCCACCTACTGGGAGGAAATCTCTAAAGCTGGAATACAGAGGATCCTAGATAGGTACACATGGAAGATTTACTCTGAAAGGTTAATGACATTAGCTGGAGTGTATGGTTTCTGGAAGTATGTTTCAAAACTCGAGAGGCGTGAAACCCGGCGATACCTTGAGATGTTCTACATTCTCAAGTACCGTAATTTGGTAAAGTCCGTCCCTCTGCAAATCGACGAGGAACATTAG
Protein:  
MATPKLSRIPSMRERVEDTLSAHRNELVSLLSRYVAQGKGMLQPHHLIDELDKVIGVDDRANLTLRDGPFGEVLKSAQEAIVLPPFVAIAVRPRPGVWEYVRVNVHELSVEQLSVSEYLQFKEQLVDGHVNDNYVLELDLEPFNASFPRPTRSSSIGNGVQFLNRHLSSVMFRSKDSLEPLLDFLRAHKHKGQAMMLNDRMYSIPRLQSALTKAEGYLSKLPADTPYSLLEHDLQGMGFERGWGDTTERVLKMMHLLMDILQAPDPSALETFLGRIPMVFNVVILSIHGYFGQANVLGLPDTGGQVVYILDQVRALENEMLMKTKQQGLNVTPRILIVTRLIPDAKGTSCNQRLERISGTKHSHILRVPFRTEKGVLRKWISRFDVWPYLEKFAEDAASEIASELQGVPDLIIGNYSDGNLVASLLSHSMGVTQCTIAHALEKTKYPDSDIYWKNFEDKYHFSAQFTADLIAMNNSDFIITSTYQEIAGTKNTVGQYESHTAFTLPDLYRVVHGIDVFDPKFNIVSPGADMSIYFPYSDKENRLTALHGSIEKLLFGPEQNEEHIGTLSDPSKPIIFSMARLDRVKNITGLVECYAKNTKLRELVNLVLVAGYNDVKKSNDREEIAEIEKMHGLMKEYNLDGQFRWLSSQTNRARNGELYRYVADKRGVFVQPAFYEAFGLTVVEAMVSGLPTFATCHGGPAEIIEDGVSGFHIDPYHPDDVASRLADFFQRCKEDPTYWEEISKAGIQRILDRYTWKIYSERLMTLAGVYGFWKYVSKLERRETRRYLEMFYILKYRNLVKSVPLQIDEEH